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Abstract 

Recently Guth and Katz [TB] invented, as a step in their nearly complete solution of Erdos's 
distinct distances problem, a new method for partitioning finite point sets in M. d , based on the 
Stone-Tukey polynomial ham-sandwich theorem. We apply this method to obtain new and 
simple proofs of two well known results: the Szemeredi-Trotter theorem on incidences of points 
and lines, and the existence of spanning trees with low crossing numbers. Since we consider 
these proofs particularly suitable for teaching, we aim at self-contained, expository treatment. 
We also mention some generalizations and extensions, such as the Pach-Sharir bound on the 
number of incidences with algebraic curves of bounded degree. 



1 Introduction 

A dramatic breakthrough in discrete geometry took place in November 2010, when Guth and Katz 
[16] completed a project of Elekes, exposed in [13], and established a nearly complete solution of 
Erdos's distinct distances problem [14] . originally posed in 1946. 

In one of the main steps of their analysis, they apply the polynomial ham- sandwich theorem 
of Stone and Tukey [33] to obtain a partition of a finite point set P in M. d with certain favorable 
properties, detailed in Section T2.3I below. The partition is effected by what we call an r -partitioning 
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08-30272, by Grant 2006/194 from the U.S.-Israel Binational Science Foundation, by grant 338/09 from the Israel 
Science Fund, and by the Hermann Minkowski-MINERVA Center for Geometry at Tel Aviv University. 
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polynomial. The removal of the zero set Z of the polynomial partitions space into connected 
components, each containing at most \P\/r points of P. A key feature of the construction is that 
the degree of the polynomial achieving this need not be too high, only 0{r l l d ), and thus the 
interaction of other objects, such as lines or hyperplanes, with Z is under control in some sense. 

In this paper we apply partitioning polynomials in several classical problems of discrete geom- 
etry, mostly planar ones, and we provide new and simple proofs of some well known results. 

Incidences. For a finite set P C M 2 and a finite set L of lines in R 2 , let I(P,L) denote the 
number of incidences of P and L, i.e., of pairs (p, £) with p G P, i G L, and p G i. 

The following fundamental result was first proved by Szemeredi and Trotter in 1983, in response 
to a problem of Erdos [13] • 

Theorem 1.1 (Szemeredi and Trotter |35| ) I(P,L) = 0(m 2 / 3 n 2 / 3 + m + n) for every set P 
of m distinct points in the plane and every set L of n distinct lines. 

We remark that the bound in the theorem is tight in the worst case for all m,n (see |14| . 
for original sources or |24j for a presentation). 

A simpler proof of the Szemeredi-Trotter theorem, based on cuttings, was given by Clarkson et 
al. [8] in 1990, and in 1997 Szekely [M] found a beautiful and elegant proof, based on the crossing 
lemma for graphs embedded in the plane (also see, e.g., [24"]). 

In Section [3] we present an alternative proof based on polynomial partitions, hoping that the 
reader will find it equally simple. We also believe that the new proof is suitable for teaching 
purposes, so our goal is to make the exposition as elementary and self-contained as possible. For 
this we also give proofs of several well known and basic facts about multivariate polynomials. The 
only major ingredient of the analysis which we do not prove is the classical ham-sandwich theorem, 
which we use as a black box (see, e.g., |25] for an exposition). 

The Szemeredi-Trotter theorem has led to an extensive study of incidences of points and curves 
in the plane and of points and surfaces in higher dimensions. A survey of the topic can be found in 
Pach and Sharir [30J. In particular, the following theorem on incidences between points and planar 
curves has been established: 

Theorem 1.2 (Pach and Sharir [29]) Let P be a set of m points and let T be a set of n simple 
curves, all lying in the plane. If no more than C\ curves of V pass through any k given points, and 
every pair of curves of T intersect in at most C2 points, then 

I(P, T) = ( m */(2*-i) n (2*-2)/(»-i) + m + n) , 
with an appropriate constant of proportionality that depends on k,C±,C2. 

A weaker version of this result, where the the curves in T are assumed to be algebraic and 
to belong to a family parameterized by k real parameters, was obtained earlier, also by Pach and 
Sharir [28] (special cases of this result, e.g., for incidences of points and circles, were obtained even 
earlier by Clarkson et al. [8]). 

In Section [U we give a simple proof of a version of Theorem 11.21 with the additional assumption 
that r consists of algebraic curves of degree bounded by a constant. 
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Spanning trees with low crossing number. Let P be a finite set of points in R 2 . A ( geometric) 
graph on P is a graph G with vertex set P whose edges are realized as straight segments connecting 
the respective end-vertices. The crossing number of G is the maximum number of edges that can 
be intersected simultaneously by a line not passing through any point of We will consider 
geometric spanning trees on P, i.e., acyclic connected geometric graphs on P. 

The following result has been established in the late 1980s by Welzl [38] and by Chazelle and 
Welzl [TJ; also see Welzl [39]. 

Theorem 1.3 (Welzl [38], Chazelle and Welzl [7\) Every set of n points in the plane has a 
geometric spanning tree with crossing number 0{^Jn). 

The bound in the theorem is tight up to a multiplicative constant, as the example of a ^/n x yjn 
grid shows. Spanning trees with low crossing number have many applications in discrete and 
computational geometry, including range searching [7], the design of other geometric algorithms 
(see, e.g., [3]), discrepancy theory [26], and approximation [23]. 

The original proof of Theorem 11.31 constructs the tree iteratively, through a process called 
iterative reweighting. In each step several new edges are added, and these are selected using a 
packing argument with balls in a line arrangement (or, alternatively, using a so-called cutting). 
An alternative proof, replacing iterative reweighting with linear programming duality, was recently 
given by Har-Peled [18]. 

In Section [5] we present a new and simple proof of Theorem 11.31 via polynomial partitions. 

Chazelle and Welzl [7j established their result on spanning trees with low crossing number in a 
very general setting, where the points do not lie in the plane, but rather in the ground set of an 
arbitrary set system T . The bound on the crossing number is then expressed in terms of the dual 
shatter function of J 7 . 

At present it seems that the approach with polynomial partitions is not suitable for this level 
of generality. However, some generalizations are possible. First, we have verified that Theorem 11.31 
can be extended to the case where the crossing number is taken with respect to a family of algebraic 
curves of degree bounded by a constant, but we will not pursue this in this paper. 

Second, one can also prove a d-dimensional generalization of Theorem II. 3[ and this we do in 
Section [6l Here we are given a set P of n points in M. d , and consider spanning trees of P, which we 
embed into M. d by drawing their edges as straight segments connecting the respective end-vertices, 
as in the plane. The crossing number of such a tree is the maximum number of its edges that 
are crossed by a hyperplane not passing through any point of P. According to [38], [7J, for every 
n-point set in M. d there exists a straight-edge spanning tree with crossing number O (n 1-1 ^). 

We re-prove this fact using polynomial partitions, similar to the planar case. However, the 
proof is more involved in higher dimensions. Informally, the partition distributes the input points 
evenly among the resulting cells, except that some (in the worst case even all) of the points may 
lie on the zero set Z of the partitioning polynomial, and therefore not belong to any of the subsets. 

We avoid this situation using a perturbation argument. This works for the spanning tree 
construction because there we may assume general position of the input points. For incidence 
problems this assumption cannot be made, and other techniques are needed to handle the points 
on Z. We intend to investigate alternative approaches to handling points on Z in a subsequent 
paper. 

1 The condition of avoiding the points of P is important; for example, if all of the points of P are collinear, then 
the line containing P necessarily intersects all edges. 
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2 Review of tools 



2.1 Preliminaries on polynomials 

Here we recall some standard facts about polynomials. The proofs are given for didactic purposes, 
and can be skipped by more experienced readers. 

Since most of the problems that we study here are planar, we will consider mostly bivariate 
polynomials / = f(x,y) = Yli j 0'ijX l y 3 G but the analysis can easily be extended to d- 

variate polynomials in M. d . The degree of / is deg(/) = maxji + j | a^- 7^ 0}. Let Z(f) = {(x, y) G 
]R 2 I f(x, y) = 0} denote the zero set of /. 

Lemma 2.1 If £ is a line in ]R 2 and f G is of degree at most D, then either £ C Z(f), or 

\inz<j)\<D. 

Proof. Writing £ in parametric form {{u\t + v\, ^i + t^) | t G M}, we get that the points of £f]Z(f) 
are roots of the univariate polynomial g(t) := f{u\t + v\,U2t + ^2), which is of degree at most D. 
Thus, either g is identically 0, or it has at most D roots. □ 

Lemma 2.2 If f € is nonzero and of degree at most D, then Z(f) contains at most D 

distinct lines. 

Proof. We need to know that a nonzero bivariate polynomial (i.e., with at least one nonzero 
coefficient) does not vanish on all of M 2 . (Readers who do not consider this a sufficiently standard 
fact are welcome to work out a quick proof.) 

Now we fix a point pel 2 not belonging to Z{f). Let us suppose that Z{f) contains lines 
£i,...,£k- We choose another line £ passing through p that is not parallel to any £i and not passing 
through any of the intersections £i n £j. (Such an £ exists since only finitely many directions need 
to be avoided.) Then £ is not contained in Z{f) and it has k intersections with (Ji=i ^i- Lemma I2TT1 
yields k < D. □ 

In the proof of Theorem 11.31 (spanning trees with low crossing number), we will also need the 
following result. 

Theorem 2.3 (Harnack's curve theorem |17{ ) Let f G be a bivariate polynomial of 

degree D. Then the number of (arcwise) connected components of Z(f) is at most 1 + ( £, 2 ~ 1 )- ^ e 
bound is tight in the worst case. 

For our application, we actually do not need the precise bound in Harnack's theorem; it suffices 
to know that the number of components is at most 0{D 2 ). For the sake of completeness, we provide 
a short proof of an almost tight bound. 

First we recall, without proof, another basic result in algebraic geometry; see, e.g., (H [9| [TO], 

Theorem 2.4 (Bezout's theorem) Let f,g G be two bivariate polynomials of degrees Df 

and D g , respectively, (a) If the system f = g = has finitely many solutions, then their number 
is at most DfD g . (b) If the system f = g = has infinitely many solutions, then f and g have a 
nontrivial common factor. 
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For a proof of Theorem 1 2. 3 1 we choose a generic direction, and assume, without loss of generality, 
that it is the x-direction. We may assume that / is square-free, because eliminating repeated factors 
of / does not change its zero set. 

Every bounded component of Z(f) has at least two extreme points in the x-direction (that is, 
its leftmost and rightmost points). Such an extreme point has to satisfy / = f y = 0, where f y is 
the partial derivative of / with respect to y. 

Since / is square-free, / and f y have no common factor H and so by Theorem 12.41 the system 
/ = f y = has at most D(D — 1) solutions. Every bounded component consumes at least two of 
these critical points, and hence the number of bounded components is at most \D(D — 1). 

If B is a sufficiently large number, then (again, assuming generic directions of the coordinate 
axes) every unbounded component of Z{f) meets (at least) one of the two lines x = +B and 
x = —B. Thus, there are at most 2D unbounded components, and in total we get a bound of 
\D(D + 1) on all components. 

2.2 The polynomial ham-sandwich theorem 

Here we review the polynomial ham-sandwich theorem of Stone and Tukey [33], the key tool used 
by Guth and Katz in constructing their partitioning polynomials. 

We assume the standard ham-sandwich theorem in the following discrete version: Every d finite 
sets A\,. . . ,Ad C M. d can be simultaneously bisected by a hyperplane. Here a hyperplane h bisects a 
finite set A if neither of the two open halfspaces bounded by h contains more than [|^4|/2j points 
of A. 

From this, it is easy to derive the polynomial ham-sandwich theorem, which we state for bivariate 
polynomials. 

Theorem 2.5 Let Ai, . . . , A s CR 2 be finite sets, and let D be an integer such that (' D ^" 2 ) — 1 > s. 
Then there exists a nonzero polynomial f £ M[x,y] of degree at most D that simultaneously bisects 
all the sets Ai, where "f bisects Ai" means that f > in at most U-Aj|/2j points of Ai and / < 
in at most \\Ai\/2\ points of Ai. 

Proof. We note that ( £> ^ 2 ) is the number of monomials in a bivariate polynomial of degree D, 
or in other words, the number of pairs of nonnegative integers with i + j < D. We set 

k := ( D 2 2 ) ~ 1) an d we let K 2 — > M. k denote the Veronese map, given by 

*(i,y) := (xV) (y)|1 ^^€R fc . 

(We think of the coordinates in M. k as indexed by pairs with 1 < i + j < D.) 

Assuming, as we may, that s = k, we set A\ := $(Aj), i = 1,2, ... ,k, and we let h be 
a hyperplane simultaneously bisecting A^, . . . , A' k . Then h has an equation of the form aoo + 
J2i j o-ijZij = 0, where (%ij)(i,j)\i<.i+j<d are the coordinates in IR fc . It is easy to check that f(x, y) := 
Yli j a ij x% V^ is the desired polynomial (where here the sum includes aoo too). □ 

2 Assume by induction that this is true for polynomials of degree smaller than D, and let / be a square-free 
polynomial of degree D. Assume that / = h ■ g and f y = h-k for some polynomials h, g, and k, where h is not a 
constant. Then f y = h y ■ g + g y ■ h = h ■ k. So h divides h y ■ g. By induction, h and h y have no common factors, and 
so h divides g, contradicting our assumption that / is square-free. 
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2.3 Partitioning polynomials 

In this section we recall the construction of Guth and Katz [16j . specialized to the planar setting 
(our formulation is slightly different from theirs). We also (informally) compare it to older tools of 
discrete geometry, such as cuttings. 

Let P be a set of n points in the plane, and let r be a parameter, 1 < r < n. We say that 
/ G R[x, y] is an r -partitioning polynomial for P if no connected component of M? \ Z(f) contains 
more than n/r points of P. 

In the sequel, we will sometimes call the connected components of M 2 \ Z(f) cells. Let us also 
stress that the cells are open sets. The points of P lying on Z{f) do not belong to any cell, and 
usually they require a special treatment. 

Theorem 2.6 (Polynomial partitioning theorem) For every r > 1, every finite point set P C 
M 2 admits an r -partitioning polynomial f of degree at most 0(y/r). 

Proof. We inductively construct collections Vq,Vi, ■ ■ ., each consisting of disjoint subsets of P, 
such that \Vj\ < 2 3 for each j. We start with Vq '■= {P}- Having constructed Vj, with at most 
2 3 sets, we use the polynomial ham-sandwich theorem to construct a polynomial fj, of degree 
deg(fj) < V2 • 2i , that bisects each of the sets of Vj. Then for every subset Q G Vj, we let Q + 
consist of the points of Q at which fj > 0, and let Q~ consist of the points of Q with fj < 0, and 
we put Vj+i := \J QeVj {Q + ,Q~}- 

Each of the sets in Vj has size at most \P\/2 3 . We let t = |~log 2 r] ; then each of the sets in Vt 
has size at most \P\/r. We set / := /1/2 •••ft- 

By the construction, no component of IR 2 \ Z{f) can contain points of two different sets in Vt, 
because any arc connecting a point in one subset to a point in another subset must contain a point 
at which one of the polynomials fj vanishes, so the arc must cross Z(f). Thus / is an r-partitioning 
polynomial for P. 

It remains to bound the degree: 

* 2 

deg(/) = deg(/i) + deg(/ 2 ) + • • • + deg(/ t ) < V2 ]T 2^/ 2 < — — -2 l l 2 < c^r. 

j=i V 

where c = 2 v / 2/(v / 2- 1) < 7. □ 

A comparison with other partitioning techniques]! The Guth-Katz technique with par- 
titioning polynomials is useful for problems where we deal with a finite point set P and with a 
collection T of lines, algebraic curves, or algebraic varieties in higher dimensions. It provides a 
method of implementing the divide-and-conquer paradigm. 

In the planar case discussed above, the plane is subdivided by Z{f) into some number of (open, 
connected) cells, each containing at most \P\/r points of P. If T consists of lines, then every 7 G T 
intersects at most deg(/) + 1 = 0(y/r) cells (by Lemma |2.1|) . Similarly, for T consisting of algebraic 
curves of degree bounded by a constant, every 7 G T intersects at most 0(y/r) cells by Bezout's 
theorem (Theorem 12. 4j) . Thus, if we define, for every cell d of R 2 \ Z(f), a subset P { C P as the 

3 This part is slightly more advanced and assumes some familiarity with previous techniques used in incidence 
problems. 
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set of points of P contained in Q, and we let Tj consist of the lines or curves of T intersecting Cj, 
then |Pj| < \P\/r for all i, and the average size of the Tj is 0(\T\/y/r). 

There are two earlier partitioning tools in discrete geometry with a similar effect. The first, 
and simpler, kind of them are cuttings (see [6]). A cutting for a collection T of curves in the plane 
subdivides ffi 2 into a collection of connected, simply shaped cells, in such a way that no cell is 
crossed by more than a prescribed fraction of the curves of T. If we again let Pj denote the set of 
points of P in the ith cell, and Tj is the set of the curves intersecting that cell, then this time all 
Ti have size 0(\T\/y/r), and the average of the sizes |Pj| is^l \P\/r. Thus, the behavior of cuttings 
is, in a sense, "dual" to that of polynomial partitions. For many applications, this does not really 
make a difference. 

The second of the earlier tools are simplicial partitions [22] . Here, as in the case of polynomial 
partitions, the plane is subdivided into cells so that |Pj| < \P\/r for each i (where, again, Pj is the 
set of points of P in the ith cell), and no 7 £ T intersects more than 0(y / r) cellsO 

In the plane, as far as we can see, whatever can be done with polynomial partitions, can also 
be achieved through cuttings or through simplicial partitions. The main advantage of polynomial 
partitions is simplicity of the proof. On the other hand, cuttings and simplicial partitions can be 
constructed and manipulated with fairly efficient algorithms, at least in the sense of asymptotic 
complexity, which is not at all clear for polynomial partitions. (For example, finding a ham- 
sandwich cut in a high-dimensional space is a rather costly operation; see |20| for a computational 
hardness result and references.) 

Polynomial partitions may be more powerful than the earlier tools if we pass to a higher- 
dimensional space M. d , d > 2. Asymptotically optimal cuttings and simplicial partitions are known 
to exist in M. d , for every fixed d, in the case where T is a collection of hyperplanes. However, if we 
want to apply analogous methods to construct cuttings (or simplicial partitions, whose construction 
needs cuttings as a subroutine) for T consisting of algebraic surfaces of degree bounded by a 
constant, say, then there is a stumbling block. In one of the steps of the construction, we have a 
collection V of m surfaces from T. It is known that these surfaces partition M. d into 0(m d ) cells, 
but we need to further subdivide each cell into subcells, so that each of the resulting subcells can 
be described by a constant number of real parameters. There is no known general solution that 
achieves 0(m d ) subcells in total, which is the optimal bound one is after for most applications. 
For d = 3,4, the situation is still not bad, since bounds only slightly worse than 0(m d ) have been 
proved, but for d > 5, the best bound is of order roughly m 2d_4 , and so for large d, the exponent 
is almost twice larger of what it probably should be (see pQ for a more detailed discussion). The 
new approach with polynomial partitions might hopefully be able to bypass this stumbling block, 
at least in non-algorithmic applications. 

3 Proof of the Szemeredi— Trotter theorem 

We recall that we are given a set P of m distinct points and a set L of n distinct lines in the plane 
and we want to bound the number of incidences I(P,L). 

We begin with a simple observation (appearing in most of the previous proofs). 

4 Here we choose the parameterization so that it agrees with the one for polynomial partitions; the usual notation 
in the literature would use r for our \fr. 

5 In the original version of simplicial partitions [55], the cells cover R 2 , but they need not be disjoint. In a newer 
version due to Chan [5] , disjointness can also be guaranteed. 
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Lemma 3.1 I(P, L) < n + m 2 . 

Proof. We divide the lines of L into two subsets: the lines in V are incident to at most one point 
of P, while the lines in L" pass through at least two points. 

Obviously, I(P,L') < \L'\ < n. In order to bound I{P,L"), we note that a point p £ P may 
have at most m — 1 incidences with the lines of L" , since there are at most m — 1 lines passing 
through p and some other point of P. Thus, I(P, L") < m(m — 1) < m 2 . □ 

Let us remark that this lemma also follows from the K6vari-S6s-Turan theorem [2T] concerning 
graphs with forbidden complete bipartite subgraphs. In the above argument, we are really proving 
the required instance of Kdvari-S6s-Turan. 

Proof of the Szemeredi Trotter theorem. For simplicity, we first do the proof for m = n, 
and then indicate the changes needed to handle an arbitrary m. 

We set r := n 2 / 3 , and we let / be an r-partitioning polynomial for P. By the polynomial 
partitioning theorem (Theorem I2.6p . we may assume D = deg(/) = 0(y/r ) = 0(n 1//3 ). 

Let Z := Z(f), let C±, . . . , C s be the connected components of M 2 \ Z, let Pj := P n Cj, and let 
Pq := P n Z. Since / is an r-partitioning polynomial, we have |Pj| < n/r = n 1 / 3 , i = 1,2, ... ,s. 
Furthermore, let Lq C L consist of the lines of L contained in Z; we have |Lo| ^ D by Lemma [2.21 

We decompose 

s 

I(P, L) = I(P , L ) + J(P , L\L ) + J2 L )- 

i=l 

We can immediately bound 

I(P ,L ) < \L \ ■ \P \ < \Lo\n < Dn = 0(n 4 / 3 ), 

and 

I(P ,L\L ) < \L\L \D = O(n A l z ), 

since each line of L \ Lq intersects Z, and thus also Pq, in at most D = deg(f) points. 

It remains to bound ^i=i I{Ph L). Let Li C L be the set of lines containing at least one point 
of Pi (the Li are typically not disjoint). By Lemma 13. II we get 

s s 
i=l i=l 

We have Y2t=i I A I = 0((D + l)n) = 0(n 4 / 3 ), since by Lemma l2.1| no line intersects more than 
D + 1 of the sets P*. Finally, £f =1 |P| 2 < (max^ |P|) • |P| < 2 • n = 0(n 4 / 3 ). This finishes 

the proof for the case m = n. 

We generalize the proof for an arbitrary m as follows. We may assume, without loss of generality, 
that m < n; the complementary case is handled by interchanging the roles of P and L, via a 
standard planar duality. We may also assume that y/n < m, since otherwise, the theorem follows 
from Lemma [3.11 Then we set r := m 4 / 3 /n 2//3 . Noting that 1 < r < m for the assumed range of m, 
we then proceed as in the case m = n above. We get D = deg(/) = 0(m 2 / 3 /n 1 / 3 ), and we check 
that all the partial bounds in the proof are at most 0(m 2 / 3 n 2//3 ). □ 
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4 Incidences of points with algebraic curves 



As was announced in the introduction, we prove the following (weaker) version of Theorem 11.21 

Theorem 4.1 Let b,k and C be constants, let P be a set of m points in the plane, and let V be a 
family of planar curves such that 

(i) every 7 £ T is an algebraic curve of degree at most b, and 

(ii) for every k distinct points in the plane, there exist at most C distinct curves in T passing 
through all of them. 

Then I(P, T) = O (jn k ^ 2k ~ 1 ^ n^ 2k ~ 2 ^ ^ 2k ~ 1 ^ + m + n), with the constant of proportionality depending 
on b, k, C . 

In the proof, we may assume that the curves in V are irreducible^ Indeed, if it is not the case, 
we apply the forthcoming analysis to the irreducible components of the curves of T, whose number 
is at most bm. 

We begin with an analog of Lemma 13.11 

Lemma 4.2 Under the conditions of Theorem \4-l\ we have I(P,F) = 0(n + m k ), and also 
I(P,T) = 0(m + n 2 ); the constants of proportionality depend on b,k,C. 

Proof. For the first estimate, we distinguish between the curves with fewer than k incidences, 
which altogether generate 0(n) incidences, and curves with at least k incidences, observing that 
there are at most C(TZi) such curves through each point of P. 

For the second estimate, we first note that, by the assumed irreducibility and by Bezout's 
theorem (Theorem l2.4p . every pair of curves of V intersect in at most b 2 points. Then we distinguish 
between points lying on at most one curve each, which have 0{m) incidences altogether, and the 
remaining points, each lying on at least two curves. Now a single 7 E T has at most b 2 (n — 1) 
intersections with the other curves, and thus it contributes at most b 2 (n — 1) incidences with these 
latter points. So I(P, V) = 0(m + n 2 ) follows. □ 

Proof of Theorem 14.11 We may assume m < n 2 and n < m k , for otherwise, the bounds of 
Lemma 14.21 give I(P, T) = 0(m + n). 

We set r := m 2k ^ 2k ~ 1 ^ /n 2 /^ 2fc_1 \ and we observe that our assumptions on m, n yield 1 < r < m. 
Let / be an r-partitioning polynomial for P, of degree 

deg(/) = O(VF) = O^f^-^/n 1 '^-^). 

The proof now continues in much the same way as the proof of the Szemeredi-Trotter theorem. 

We put Z := Z(f), let Pq := P n Z, and let To C T consist of the curves fully contained in Z. 
Since every 7 G Tq is irreducible, it must be a zero set of a factor of / (this follows from Bezout's 
theorem), and so | To I < deg(/) = 0(y/r). Hence I(Po, To) = 0(m + | To 1 2 ) = 0(m + r) = 0(m) by 
the second bound of Lemma [4. 21 (Here the argument differs from the one for the Szemeredi-Trotter 

6 We recall that a planar algebraic curve 7 is irreducible if 7 = Z(g) for an irreducible polynomial g, i.e., one that 
cannot be written as g — g\gi with both gi,g2 nonconstant (and real in our case). For 7 = Z(g) with g arbitrary, we 
can write g = giQ2 ■ • • gk as a product of irreducible factors, and the irreducible components of g are Z(gi),. . . , Z(gk)- 
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theorem — in the latter, it was sufficient to use the trivial bound I{Pq,Lq) < \Pq\ ■ \Lq\, which in 
general is not sufficient here.) 

Next, we consider 7 G r \ Tq. Applying Bezout's theorem to 7 and every irreducible com- 
ponent of Z in turn, we see that (7 n Z\ < b ■ deg(/) = 0(^/r). So I(P ,F \ T ) = 0(n^/r) = 

Q( m k/(2k-l) n (2k-2)/(2k-l)\ _ 

Letting C\, . . . ,C S be the connected components of R 2 \ Z, it remains to bound Ylt=i I(Pi->Fi), 
where Pi = P n Cj and Tj is the set of curves meeting Cj. By Bezout's theorem once again, we 
have X^=i = 0( n ' deg(/)) = 0{n\/r). Then, by the first bound of Lemma [472], we obtain 

s s k 1 S 

^/(P i ,r i ) = o(X;(|r i | + |P i | fc )) <0(n v ^) + (max i |P|) " o(^|P|) 

i=l i=l i=l 

= O^+Wr)*" 1 ™) = 0(m fc /( 2fc - 1 )n( 2fc - 2 )/( 2fc - 1 )). 

□ 



5 Spanning trees with low crossing number in the plane 

In the forthcoming proof of Theorem 1 1.31 instead of constructing a geometric spanning tree directly, 
it will be more natural to construct an arcwise connected set X, made of segments and algebraic 
arcs, that has a low crossing number and contains the given point set P. Here we say that a 
set X C M. d has crossing number at most k if each line, possibly with finitely many exceptions, 
intersects X in at most k points. (It is easy to check that for a geometric spanning tree, this new 
definition is equivalent to the earlier one.) 

The following lemma allows us to convert such an X into a geometric spanning tree. Although 
we are not aware of an explicit reference for the statement we need, most of the ideas of the proof 
appear in the literature in some form. 

Lemma 5.1 Let P be a set of n points in the plane, and let X be an arcwise connected set con- 
taining P, with crossing number at most k. Then there exists a (geometric) spanning tree of P 
whose edges are straight segments and whose crossing number is at most 2k. 

Proof. In the first stage of the proof we build a Steiner tree S for P, whose edges are arcs contained 
in X. We order the points of P arbitrarily, into a sequence pi,P2, ■ ■ ■ ,Pn- We set Si = {pi}, and, 
having built a Steiner tree Si C X for {pi, . . . ,pi}, we choose an arc a, connecting pi + i to some 
point qi of Si, in such a way that Qj D Si = Then we set Si + i := Si U Oj. Having reached 

i = n, we set S := S n ; see Fig. [0 The crossing number of S is at most k since SCI, 

In the second stage, we replace arcs by straight segments. Namely, the points qj divide S into 
finitely many subarcs, and we replace each of them by a straight segment connecting its endpoints. 
It is easily seen (and standard) that the crossing number does not increase. This yields a Steiner 
tree for P whose edges are straight segments. 

In the third and last stage, we eliminate the Steiner points and obtain a spanning tree, at the 
price of at most doubling the crossing number. This is done by performing an inorder traversal of 
the tree, starting from some arbitrary root vertex, tracing each edge in both directions, skipping 
over the Steiner points, connecting each pair of consecutively visited points of P by a straight 
segment, and finally eliminating cycles in the resulting tour. □ 

The main step in the proof of Theorem 11.31 is the following lemma. 
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Figure 1: Illustrating the proof of Lemma 15. ll Left: Building a Steiner tree from arcs. Right: Shortcutting the 
arcs into segments. 

Lemma 5.2 Let P be a set of n points in the plane. Then there exists a set ICR 2 that contains 
P, has at most n/2 arcwise connected components, and with crossing number 0(y/n). 

Proof. If n is below a suitable constant, we can interconnect the points of P by an arbitrary 
geometric spanning tree, and so we may assume that n is large. 

We apply the polynomial partitioning theorem (Theorem l2.6p . to obtain an r-partitioning poly- 
nomial / for P, with r as large as possible but so that Z := Z{f) is guaranteed to have at most n/2 
connected components. By Theorem 12.61 we have deg(/) = 0(y/r), and so, by Harnack's theorem 
(Theorem I2.3H . we can afford to take r = n/c for a suitable constant c. 

Then, for every p £ P not lying in Z, we pick a straight segment a p connecting p to a point of Z 
(and otherwise avoiding Z). We let X := ZU{J peP \ z o~ p . Clearly, X has at most n/2 components, 
and it remains to bound its crossing number. 

Let £ be a line that is not contained in Z and that does not contain any of the segments a p 
(these conditions exclude only finitely many lines). It intersects Z in at most deg(/) = 0(\/n) 
points, and so it remains to bound the number of the segments a p intersected by I. 

Since / is an r-partitioning polynomial for P, no component of ]R 2 \ Z contains more than 
c points of P. The line I meets at most 1 + deg(/) components, and so it intersects at most 
c(l + deg(/)) = 0{yfn) of the segments a p . The lemma is proved. □ 

Proof of Theorem II. 3L In view of Lemma 15.11 it suffices to construct an arcwise connected set 
X containing P, with crossing number 0(y/n). To this end, we apply Lemma 15.21 recursively. 

We construct a sequence Bq, B\, P>2, ■ ■ ■ of sets, such that each Bi contains P and has at most 
n/2* arcwise connected components. We begin with Bq := P, and, having constructed Bi, we choose 
a point in each of its components, which yields a set Ri of at most n/2* points. Lemma 15.21 then 
provides us with a set Xi I) Ph with at most n/2 t+l components and crossing number 0(\f n/2 1 ). 
We set -Bj+i := Bi U Xi and continue with the next iteration, until for some in we reach an arcwise 
connected Bi Q , which we use as X. The crossing numbers of the Xi are bounded by a geometrically 
decreasing sequence, and so X has crossing number bounded by its sum, which is 0(y/n), as 
required. □ 

6 Spanning trees in higher dimensions 

Here we prove the higher-dimensional generalization of Theorem 11.31 mentioned in the introduction. 

Theorem 6.1 Every set P ofn points in M. d admits a geometric spanning tree with crossing number 
(w.r.t. hyperplanes) at most C ( in 1 ~ l / d , with a sufficiently large constant depending on d. 
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When one tries to extend the planar proof from Section \5\ with the appropriate higher-dimensional 
analogs of the polynomial partition lemma and Harnack's theorem (discussed below), a problem 
arises when almost all the points of P happen to lie on the zero set Z(f) of the partitioning poly- 
nomial. (This situation seems hard to avoid — for example, P may lie on a low-degree algebraic 
variety, in which case the zero set of each of the bisecting polynomials would simply coincide with 
this variety.) 

In the planar case this did not matter, since we could use Z(f) itself as a part of the connecting 
set X. However, in higher dimension, we cannot take all of Z(f) (which is typically a (d — 1)- 
dimensional object), so we would still need to construct a suitable connecting set with low crossing 
number within Z(f). 

Fortunately, the spanning tree problem behaves well with respect to small perturbations. Namely, 
it is easy to see (and well known) that the crossing number of a geometric spanning tree cannot 
change by a (sufficiently small) perturbation of its vertex set, and this will allow us to avoid the 
situation with too many points on Z(f). Before carrying out this plan, we first summarize and 
review the additional tools we need, beyond those already covered. 

6.1 Additional tools 

The polynomial ham sandwich theorem (Theorem 12. 5p and the polynomial partitioning theorem 
(Theorem 12 .6p immediately generalize to with the (i-variate r-partitioning polynomial / having 
degree 0(r l / d ) (this relies on the fact that the number of monomials of degree D in d variables is 
{ D ^ d ), so the degree will be the smallest integer satisfying ( D ^ d ) — 1 > r). 

We will also need a kind of generalization of Harnack's theorem, dealing with components of 
the complement of Z(f), rather than with the components of Z(f): 

Lemma 6.2 Let f be a real polynomial of degree D in d variables. Then the number of connected 
components ofM. d \ Z(f) is at most 6(2D) d . 

This follows, for example, from Warren [371 Theorem 2] (also see [4] for an exposition, and [2] 
for a neatly simplified proof). 

We also note that if / is as in Lemma [6. 2 1 and h is a hyperplane in M. d , then h\Z(f) has at most 
6(2D) d ~ 1 connected components, and consequently, h intersects at most that many components of 
M. d \ Z(f). Indeed, this is clear from Lemma 16.21 if h is the coordinate hyperplane x^ = 0, and the 
general case follows by a linear transformation of coordinates. 

6.2 A general position lemma 

We need the following lemma, which is probably known, but unfortunately we do not have a 
reference at the moment. 

Lemma 6.3 Let d, D be given integers, and let k := ( D j d ) — 1- Let P = (pi, ■ ■ ■ ,Pk+i) be an 
ordered (k + l)-tuple of points in M. d . Let us call P exceptional if it is contained in the zero set of 
a nonzero d-variate polynomial of degree at most D. Then there is a nonzero polynomial ip = ijjd,D 
with integer coefficients in the variables Zij, 1 < i < k + 1, 1 < j < d, such that all exceptional 
(k + l)-tuples (p±, . . . ,Pk+i) belong to the zero set of tp (that is, if we set z^ to the jth coordinate 
of pi, for all i,j, then ip evaluates to 0). 
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Proof. The value of k in the lemma is the number of nonconstant monomials of degree at most 
D in the d variables x\, . . . , Xd- Let fj,±, ni% ■ ■ ■ , Mfe be an enumeration of these monomials in some 
fixed order. 

It is convenient to phrase the argument using the Veronese map M. d — > R fc , which we en- 
countered in Section [2T21 for the special case d = 2. For x = (xi, . . . ,Xd) £ we can write 
= (^( x ) | i = l, 2, . . . , k) e R k . 

As in the proof of Theorem 12.51 the zero set Z(f) of a polynomial / of degree at most D 
can be written as where h is a suitable hyperplane in M fc . Thus, the condition for a 

sequence P = (pi, . . . ,Pk+i) of points in M. d to be exceptional is equivalent to the k + 1 points 
3>(pi), . . . , &(pk+i) lying on a common hyperplane in 

The condition that k + 1 points qi, . . . , (/fc+i in R fc lie on a common hyperplane can be expressed 
by the vanishing of a suitable determinant in the coordinates of q\, . . . , qt+i- Namely, it is equivalent 
to det(^4) = 0, where 



A = A(q 1 ,...,q k+1 ) 



We define the desired polynomial ip by 



/ 1 
1 



921 



<7l2 
922 



Q2k 



\ 1 <?(fc+l)l Q(k+1)2 



1(k+l)k I 



ip = ^(zn, z ik+ i)d) ■= det(A(*(afi), $(z 2 ), . . . , $(z fc +i))), 

where = (zji, . . . , Zidj- Clearly, ip has integer coefficients, and by the above, it vanishes on all 
exceptional sequences; it remains to verify that it is not identically 0. 

Assuming the contrary, it means that the images of any k + 1 points under the Veronese map lie 
on a common hyperplane in Mr. This in turn implies that all of is contained in a hyperplane. 

But the $-preimage of every hyperplane is the zero set of some nonzero polynomial, and thus it 
cannot be all of M. d (extending the observation in the proof of Lemma 12.21 to higher dimensions) 
The resulting contradiction proves the lemma. □ 



6.3 Proof of Theorem 16.11 

Given a finite point set P C ffi d , we first perturb each point slightly, obtaining a new set P', for 
which we may assume that the coordinates of its points are algebraically independent (i.e., they do 
not satisfy any nontrivial polynomial equation with integer coefficients) |f] 

In particular, for every D = 1, 2, . . ., if we set k := ( J ) — 1 as in Lemma 16.31 then no (k + 1)- 
tuple of points of P' is contained in Z(f), for any nonzero polynomial / of degree at most D. 

By the observation mentioned at the beginning of Section El it suffices to exhibit a geometric 
spanning tree with crossing number 0(n 1_1//<i ) for P' . 

7 Another way to see that ip is not identically zero is to consider two terms t\ and ti in the expansion of 
det(A($(zi), <E>(z2), . . • , $(zfc+i))). There is a row i of A($(zi), $(.82), ■ • ■ , 3>(zfc+l)) from which ti and t% contain 
different elements, ft follows that t\ and ti must contain a different power of one of the variables zn, ■ ■ ■ , 2«j- We 
get that all terms in the expansion of det(A(&(zi), $(2:2), ■ ■ ■ , <&(zfc+i))) are are different monomials, so ip cannot be 
identically zero. 

8 The existence of such P' is well known and follows, e.g., by a standard measure argument via Sard's theorem 
(which guarantees that the zero set of every nonzero multivariate polynomial has zero measure; see, e.g., |31l I32j ). 
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Moreover, it suffices to show that there exists a geometric graph G on the vertex set P' with at 
most n/2 components and with crossing number 0(n 1_1 / d ); the existence of the desired spanning 
tree then follows by recursion on the size of P' (as in the proof of the planar case). 

So we set r := n/c for a sufficiently large constant c > 0, and construct an r-partitioning 
polynomial / for P', of degree D = 0(r l l d ); thus, no component of M. d \ Z contains more than c 
points of P' , where Z = Z{f). 

By the algebraic independence of P', and by Lemma 16.31 Z contains fewer than { D ^ d ) = 0(r) 
points of P', with a constant of proportionality depending only on D. For c sufficiently large, we 
thus have |Pq| < j, where Pq := P' n Z. By Lemma 16.21 we may also assume that for c sufficiently 
large, M d \ Z has at most f components. 

For each component U of M d \ Z, we now interconnect the points of U fl P' by an arbitrary 
geometric spanning tree Tjj. The geometric graph G is the union of all the trees Tjj and the points 
of -Pq (which appear as isolated vertices in G). The number of connected components of G is at 
most 5 (one for each Tjj and one for each point of Pq). It remains to bound its crossing number. 

To this end, let h be a hyperplane avoiding P', and let us consider an edge {p, q} of G crossed 
by h. This edge belongs to some Tjj, and so the points p and q lie in the same component U of 
M. d \ Z. Considering an arc a C U connecting p to q, we see that h has to intersect a and thus U 
too. By the remark following Lemma 16.21 h intersects at most 0(D d ~ 1 ) = O^i 1 ^ 1 ^) components 
of M. d \ Z, and within each such component U it meets at most c edges of G (that is, of Tjj). Hence 
the crossing number of G is 0(n l ~ l l d ), as claimed. □ 

7 Conclusion 

We regard this paper as an initial stepping stone in the development of applications of the new 
algebraic machinery of Guth and Katz. It is encouraging that this technique can replace more 
traditional approaches and yield simpler proofs of central theorems in combinatorial geometry. 

Of course the real challenge is to use the techniques to obtain improved solutions to other 
"hard Erdos problems in discrete geometry" (borrowing from the title of [313)) as Guth and Katz 
themselves did, first for the joints problem in [15] and then for the harder distinct distances problem 
in [16] . There is a long list of candidate problems, of varying degree of difficulty. Perhaps the hardest 
in the list is the planar unit distances problem of Erdos: What is the maximum possible number 
of unit distances determined by a set of n points in the plane? This problem seems to require an 
algebraic approach, mainly because the best known upper bound, (3(n 4//3 ), is known to be tight if 
the norm is not Euclidean, as shown by Valtr [36] . 

In closing, one should note that the algebraic approach used in this paper has also some disad- 
vantages. For one, it seems to require the objects to be algebraic or semialgebraic. For example, 
the Szemeredi- Trotter theorem can easily be extended to yield the same bound on the number of 
incidences of points and pseudolines, using, e.g., the combinatorial proof technique of Szekely [34] . 
but such an extension does not seem to follow from the polynomial partitioning technique. The 
same situation occurs in the setup of Theorem 1 1.2 1 where the general situation considered there can 
be handled by traditional combinatorial tools, but not by the algebraic machinery, which can only 
establish weaker variants, like the one in Theorem 14.11 Perhaps some abstract version of polynomial 
partitions, yet to be discovered, might combine the advantages of both approaches. 
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